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Abstract 

Background: Our current understanding of evolution is so tightly linked to template-dependent replication of 
DNA and RNA molecules that the old idea from Oparin of a self-reproducing 'garbage bag' ('coacervate') of 
chemicals that predated fully-fledged cell-like entities seems to be farfetched to most scientists today. However, 
this is exactly the kind of scheme we propose for how Darwinian evolution could have occurred prior to template 
replication. 

Results: We cannot confirm previous claims that autocatalytic sets of organic polymer molecules could undergo 
evolution in any interesting sense by themselves. While we and others have previously imagined inhibition would 
result in selectability, we found that it produced multiple attractors in an autocatalytic set that cannot be selected 
for. Instead, we discovered that if general conditions are satisfied, the accumulation of adaptations in chemical 
reaction networks can occur. These conditions are the existence of rare reactions producing viable cores 
(analogous to a genotype), that sustains a molecular periphery (analogous to a phenotype). 

Conclusions: We conclude that only when a chemical reaction network consists of many such viable cores, can it 
be evolvable. When many cores are enclosed in a compartment there is competition between cores within the 
same compartment, and when there are many compartments, there is between-compartment competition due to 
the phenotypic effects of cores and their periphery at the compartment level. Acquisition of cores by rare chemical 
events, and loss of cores at division, allows macromutation, limited heredity and selectability, thus explaining how 
a poor man's natural selection could have operated prior to genetic templates. This is the only demonstration to 
date of a mechanism by which pre-template accumulation of adaptation could occur. 

Reviewers: This article was reviewed by William Martin and Eugene Koonin. 

Keywords: origin of life, prebiotic evolution, chemical evolution, catalytic reaction networks, autocatalytic sets, 
replicators, protocells, metabolism-first theory of origin of life 



Background 

There are two camps in the origin of life. The metabo- 
lism-first camp advocates consider improbable that 
RNA-like self-replicating polymers appeared before nat- 
ural selection had operated on chemical networks [1-3], 
whereas genetics-first supporters find implausible the 
idea that molecular networks without genetic control 
could have undergone Darwinian evolution [4]. This 
Gordian knot was obviously cut on Earth around 3.5 bil- 
lion years ago or even earlier [5]. A solution to the con- 
undrum can be found in general evolutionary principles 
shared by some chemical and biological systems. 
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A unifying theme in both camps has been self-sus- 
tained autocatalysis: Nature's ability to generate copies 
of a given entity leading to multiplication. It boosted the 
concentration of required molecules in Darwin's warm 
little pond' [6] (or any relevant environment for chemi- 
cal evolution), and allowed the increase of replicating 
(reproducing) units in a geometrical progression that 
makes the 'survival of the fittest' possible [7,8]. Dyson 
[9,10] and Kauffman [1,11] had developed some early 
ideas on how sets of mutually autocatalytic biopolymers 
could undergo replication, even if none of the compo- 
nents were individually autocatalytic. The central thesis 
in this scenario was that template replication is not 
required to achieve an autocatalytic set [12]. The various 
- and sometimes inconsistent - definitions of autocataly- 
tic sets are reviewed in Additional file 1. This paper out- 
lines a general approach that enables a more rigorous 
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study of the structure and evolutionary potential of che- 
mical reaction networks [13]. 

Originally autocatalytic sets of polymers [1,11] were 
introduced to explain how self-organization could pro- 
vide a complex system upon which natural selection 
could act. It has been suggested that "...some autocataly- 
tic sets will reproduce more rapidly than others and 
hence will have higher Darwinian fitness" and, therefore, 
"we have evolution without a genome"([l], pp. 332-333). 
However, no rigorous analysis of the putative evolvabil- 
ity of autocatalytic sets (but see [4] for a demonstration 
of the absence of Darwinian evolution in other models) 
has been carried out so far. For example, Bagley and 
Farmer [14] suggest that reaction networks provide a 
simple model for studying evolution with an emergent 
notion of fitness (i.e., they are clearly referring to Darwi- 
nian evolution), but subsequently [15] they used the 
word evolution in the Spencerian sense of change, 
rather than of the actual accumulation of adaptations, 
which is what we are really interested in. 

At the time of writing then, we are still a long way 
from knowing whether autocatalytic sets offer a plausi- 
ble model for the emergence of evolvability. The impor- 
tance of such an investigation is emphasized by recent 
findings showing that conceptually similar models [16] 
failed to pass the ultimate test [4]. As John Maynard 
Smith [17] put it: a population of units of evolution is 
"any population of entities with the properties of multi- 
plication (one entity can give rise to many), variation 
(entities are not all alike, and some kinds are more likely 
to survive and multiply than others), and heredity (like 
begets like) will evolve. A major problem for current 
evolutionary theory is to identify the relevant entities". 
Can autocatalytic sets, as originally conceived, be units 
of evolution? Our answer is no. However, using the 
minimal model of polymer chemistry as a popular 
example, we outline the very general requirements that 
enable reaction networks to act as units of evolution, 
whatever the underlying chemistry used. We show that 
very simple and general network level constraints must 
be satisfied for evolvability in chemical networks, thus 
justifying the utility of such an abstract model. 

In terms of "real" chemistry we have been inspired by 
the concept of autocatalytic protein networks [1,10,18], 
but in principle several different polymer species (even 
RNA) could potentially realize such networks. In fact, a 
similar evolutionary mechanism has even been demon- 
strated in a model of random bimolecular rearrange- 
ments rather than ligation and cleavage reactions, thus 
extending the scope of the mechanism proposed here 
[19,20]. One may object to the motivation of the present 
paper by simply pointing out that no autocatalytic poly- 
mer networks have been shown to exist yet, let alone to 
evolve. This criticism is not completely fair, since we 



have the autocatalytic networks of peptides realized by 
Ghadiri and Ashkenazy [21] (although there a direct 
templating effect as in [22] plays a crucial role, while 
the polymer chemistry we assume here does not rest on 
direct template replication). In any case, it is a fact that 
amino acids and protein(-like) polymers readily form in 
various prebiotic experiments, and random polypeptides 
show all kinds of spontaneous enzymatic activity, 
whereas we certainly do not know where the RNA 
world came from. What we are interested in is whether 
spontaneous enzymatic activity of such polymers can 
"ignite" reflexively autocatalytic networks, and whether 
these networks can undergo evolution by natural selec- 
tion. This theoretical analysis is not a substitute for 
badly needed experimental work, not even for more 
detailed theoretical analyses: it is a first step in a (sur- 
prisingly) promising direction, and aims to stimulate 
experimental and further theoretical work. 

The first part of the paper examines the original 
claims made about autocatalytic sets, discusses earlier 
criticisms of these models, and identifies various chemi- 
cal organizations within the reaction network. We focus 
on an entity we call the autocatalytic core. The second 
part dismisses the putative evolvability of autocatalytic 
sets per se, but proves the capacity of certain special 
kinds of chemical reaction networks, i.e. those contain- 
ing multiple autocatalytic cores enclosed in compart- 
ments, to sustain selectable hereditary variation. It is 
essential to realize that there has been no demonstration 
previously of how or even whether chemical reaction 
networks could accumulate adaptations. 

Results and discussion 

Anatomy of an autocatalytic set 

Kauffman ([11] pp. 2-3) defines an autocatalytic set as 
an arrangement of molecules in which "every member 
of the autocatalytic set has at least one of the possible 
last steps in its formation catalyzed by some member of 
the set, and that connected sequences of catalyzed reac- 
tions lead from the maintained 'food set' to all members 
of the autocatalytic set". This is more formally defined 
by Hordijk and Steel [23,24], who state that an R (sub) 
set of reactions is called (i) reflexively autocatalytic (RA) 
if every reaction in R is catalyzed by at least one mole- 
cule involved in any of the reactions in R, (ii) food-gen- 
erated (F) if every reactant in R can be constructed from 
a small "food set" by successive applications of reactions 
from R, and (Hi) reflexively autocatalytic and food-gen- 
erated (RAF) if both RA and F. 

The concept of an autocatalytic or RAF set, although 
important for questions of self-organization, does not 
directly address heredity or selectability [24]. However, 
such a set can be divided into one or more strongly 
connected autocatalytic cores and their peripheries, and 
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Figure 1 Classification of various network modules within autocatalytic sets. food1-food6: food set that is assumed to be present at all 
times, A-D: non-food species generated by ligation/cleavage reactions. Solid lines: reactions; dotted lines: catalytic activities. Orange dotted lines 
show the superimposed autocatalytic loops. (A) Viable autocatalysts (A in all three examples) are the necessary units needed for exponential 
growth of an autocatalyst, in contrast to suicidal autocatalysts (B in all three examples) that use reactants only produced by the autocatalytic 
reaction itself. (8) A molecular species can be directly autocatalytic, forming a one-member autocatalytic loop, or several species can form loops 
of various sizes that result in indirect autocatalysis. (0 A loop is autocatalytic - and able to grow exponentially - as long as at least one of the 
steps is a catalytic dependency. Therefore a loop can be made of solely catalytic or mixed couplings. (D) An autocatalytic core contains one or 
more linked loops. Note that any member of a core (A and B in all three examples) is sufficient to act as a seed for the core. Several distinct 
cores can form within a catalytic reaction network. Some can exist independently of other cores, while dependent cores rely on others as food 
supply or catalysts. (£) An autocatalytic core is typically associated with a periphery that is dependent on the core (C and D in first example). It is 
also possible that a molecular periphery appears only if two or more cores are present (D in second example). We propose that autocatalytic 
cores are the units of heritable adaptation in chemical networks. 
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Figure 2 Multiple cores result in selectable attractors for a chemical network, food 1 -food 1 1: food set that is assumed to be present at all 

times, A-H: non-food species generated by ligation/cleavage reactions. Solid lines: reactions; dotted lines: catalytic activities. Orange dotted lines 

show the superimposed autocatalytic loops. Structural considerations: Autocatalytic sets can contain several distinct autocatalytic units, each of 

which can be divided to a core of autocatalytic molecules and a periphery. Here, two independent cores are shown. The first consists of the two 

linked loops A A and A B A. The second core includes the two linked loops C -» C and C -» D E C, with the periphery of F and 

G. H is the shared periphery of the two cores that requires both for its production. Dynamical considerations: This platonic reaction network can 

manifest in four possible stable compositions of the core-periphery units: (i) no cores (only food species); (ii) only first core (A, B; yellow area); (iii) 

only second core (C, D, E, F, G; blue area); (iv) both cores (all species). Now imagine that we have a compartment that only contains food 

species, but rare uncatalyzed reactions among them are possible. The uncatalyzed appearance of any one molecule of core species is sufficient 

to produce all the core species and the periphery species of that core, e.g. either A or B for the first core and either C, D, E for the second. Now 

let us assume that after reaching a certain size a compartment that contains both cores will split and produce propagules. If neither C, D or E is 

present in the daughter compartment, the second core is lost and the remaining molecules of its periphery will be washed out from the 

compartment. Discovering cores by rare reactions, and losing cores by segregation instability opens up the possibility for a chemical reaction 

network to respond to natural selection, 
k J 



we propose that these cores are the units of heritable 
adaptations in reaction networks (for chemical network 
motifs see Figure 1 and for a specific example see Figure 
2). A core can be viewed as a chemical network geno- 
type and its corresponding periphery as a chemical net- 
work phenotype (although without a modular or 
compositional mapping between them). An autocatalytic 
core (which we abbreviate from now on to 'core') con- 
tains one or more linked autocatalytic loops [18]. Auto- 
catalytic loops are closed circular paths of any length 
where each molecule in the loop depends on the pre- 
vious one for its production (Figure 1). In the core all 
species catalyze the production of all other species, 
including themselves, which means that they are indir- 
ectly autocatalytic (Figure 1). The periphery consists of 
molecular species that are catalyzed by the core (Figure 
1). The provision of any one molecule of core species is 
sufficient to produce all the core species and the periph- 
ery species of that core; in other words, all core mole- 
cules contain the information that is necessary for 
igniting and sustaining the autocatalytic core and per- 
iphery and can therefore act as an autocatalytic seed. 



This is not the case for periphery molecules that 
depend, as a phenotype does on its genotype, upon the 
core. Note that an autocatalytic or RAF set as defined 
above can contain any number of distinct core-periphery 
units (Figure 1) and the structurally and kinetically pos- 
sible combinations of such units define different alterna- 
tive stable states of the same chemical network (Figure 
2). 

Having defined an autocatalytic core as a set of con- 
nected autocatalytic loops, it is important to distinguish 
between the possible types of such loops (Figure 1). 
Typically, the cycle of reactions is coupled by catalytic 
dependencies. However, as Eigen [18] has shown, the 
cycle maintains its autocatalytic properties as long as at 
least one of the steps is catalytic (an idea that was 
missed by previous models in [11,23]). All other steps 
can be substrate dependencies where the product of the 
previous step serves as a precursor for the next reaction 
(e.g. consider the dependency of Y upon X in the reac- 
tion A + X < — > Y; Figure 1). In other words, one reac- 
tion where the product is not consumed but serves as a 
catalyst is enough for the exponential increase of the 
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mass of the cycle. It can be misleading to solely focus 
on directly or indirectly autocatalytic molecules also, 
because of the possibility of what we call suicidal auto- 
catalysts [13]. An autocatalytic molecule can be suicidal 
in a kinetic sense. Bearing in mind that all reactions in 
autocatalytic sets of biopolymers [11] are assumed to be 
reversible, let us consider the simple autocatalytic reac- 
tion A + X < — > 2A. If X is not present (or present in 
very low concentration) this reaction will go in the 
direction of self-decomposition, in a form of autoinhibi- 
tive cycle (Figure 1). Such suicidal autocatalysts are 
obviously incapable of exponential growth, the very fea- 
ture that gives autocatalysis its evolutionary significance. 
A viable autocatalytic molecule, either directly autocata- 
lytic or embedded in a longer autocatalytic loop, how- 
ever, does grow exponentially. Note that rather similar 
examples of viable and suicidal autocatalysts can be 
found in contemporary biochemistry: whereas the Calvin 
cycle is a network of autocatalytic sugar production, the 
pentose phosphate pathway is an example of autoinhibi- 
tive decomposition of sugar phosphates [25]. Viable 
autocatalytic loops are necessary but not sufficient for 
evolution by natural selection of autocatalytic networks, 
as we shall see below. 

Spontaneous formation of autocatalytic sets in a polymer 
chemistry 

The original mathematical model of autocatalytic sets 
[11] assumes the following: (/) there exists a large food 
set of abundant polymers naturally formed in the envir- 
onment up to some low level of complexity, i.e. up to 
length M consisting of B types of monomers (e.g. a, b, 
aa, bb); (ii) each molecule has a certain probability P of 
catalyzing each ligation-cleavage reaction. The model 
assumes infinite discriminability, in other words, a mole- 
cule either does or does not catalyze a particular reac- 
tion without quantitative variation in efficiency. 
However, it does not assume specificity, because a cata- 
lyst typically catalyses a number of reactions (on aver- 
age, P fraction of the possible reactions). It was 
demonstrated that above a certain catalytic probability 
threshold a chain reaction is triggered and due to cataly- 
tic closure autocatalytic sets appear [11]. 

Hordijk and Steel [23] verified this claim by generating 
random networks of reversible ligation/cleavage reac- 
tions between strings up to length n - 20, where each 
molecule had the probability P of catalyzing each reac- 
tion. At low values of P they found unconnected sets 
utilizing separate food sets, but at higher values a perco- 
lation phenomenon produced fully connected autocata- 
lytic sets. Farmer and co-workers [12,14] were the first 
to implement the original mathematical model and con- 
firmed that a supracritical reaction network that keeps 
growing with accelerating speed arises above a certain 



catalytic probability P c . By constraining the growing cat- 
alytic reaction network in a flow reactor with finite 
mass and lower bound concentration threshold (the 
relevant scenario being here to study the issue of evolva- 
bility in compartmentalized systems), they implemented 
a chemical model where the size of the chemical net- 
work shows logistic growth above P c . Our initial task 
was to corroborate these results and to investigate the 
underlying structure of the catalytic reaction network 
(the methods are described in detail in Additional file 
1). We found that as the networks grow in size they 
form one large autocatalytic core consisting of all mole- 
cular species above the concentration threshold. How- 
ever, autocatalytic cores mostly consist of suicidal 
autocatalysts as only a small minority of autocatalytic 
species use valid reactants in the autocatalytic reactions 
and thus form viable loops (see Figure 1). Note that 
although the number of viable loops increases with sys- 
tem size, they are always within the same viable core 
and therefore they cannot be independent targets for 
natural selection (see Additional file 1). In conclusion, 
we substantiate the speculation [1] that a self-sustaining 
network of reactions -an autocatalytic primitive metabo- 
lism- appears in this minimal model of polymer chemis- 
try (Figure 3). 

Our next task was to verify that the previous claim 
remained true in the face of earlier criticisms of the 
model. Thus, a critical parameter in the model is the 
probability P that each molecule can catalyze each liga- 
tion-cleavage reaction, which was assumed to be con- 
stant. This assumption led to the serious objection that 
the model implied an unrealistically high probability (of 
one) that a peptide could act as a catalyst [26]: when 
the maximum length M of polymers in the set increases, 
the number of reactions (((M-2) x 2 M+1 ); [11]) increases 
faster than the number of molecules (N « 2 M+1 ), there- 
fore all molecules quickly become catalytic - an outcome 
that is clearly very unrealistic. To remedy the situation, 
the parameter P should be a composite of two probabil- 
ities: the probability P 1 that the molecule is a catalyst, 
and the probability P" that a molecule catalyzes a given 
reaction [26]. We implemented this criticism, but with 
the caveat that it is unlikely that a catalyst is expected 
to catalyze only one out of the infinite possible reactions 
as suggested by Lifson [26], following assumptions in 
([1] p. 306). That is, defining 

p = p' . p" = P f /[(M - 2) x 2 M+1 ] (1) 

it is implicitly asserted that the probability that a catalyst 
catalyzes a given reaction is not independent of the prob- 
ability that it catalyzes another reaction - but why would 
considering a bigger reaction space make catalysis less 
likely? In our view, a more reasonable scenario is to 
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Figure 3 Emergence of a self-sustaining network of reactions in a flow reactor. (A) The squares show critical thresholds for subcritical 
(empty squares) or supracritical (coloured squares) growth of the reaction network as a function of the firing disc (maximum length of 
molecular species in the food set) and the probability P that a species catalyses a specific reaction. The darkness of a square reflects the 
proportion of 100 runs in which the network exceeded one of the following conditions: > 2 x 10 7 reactions or > 10 5 molecular species (note 
that in any finite system the reaction network cannot be explored infinitely due to mass constraints). (£) The crucial parameter P was 
decomposed into its two elementary probabilities: P' (the probability that a species can be catalytic) and P" (the per reaction probability that this 
catalyst catalyses a reaction). When P' decreases P" must be considerably higher for reaction networks to keep growing, but there is a threshold 
above which catalytic networks grow supracritically. (0 Weak inhibition does not prevent formation of large catalytic reaction networks. For 
values of P that do produce catalytic network growth, strong non-competitive inhibition is introduced by choosing with probability /(that a 
species removes another species from the reactor completely if at least one molecule of the inhibiting species exists (this is clearly a worst-case 
assumption). Left: supracritical growth without inhibition. Middle: weak inhibition results in alternating fast and slow growth phases. Right: strong 
inhibition makes the network subcritical. 



assume that P' is defined as it was above, but P" is now 
considered to be the per reaction probability that a catalyst 
catalyzes the reaction. When our previous simulations 
were re- implemented with constant P' and P" values it was 
found that when the ratio of catalysts {J* 9 ) decreases, the 
probability of catalysis (P") must be considerably higher 
for reaction networks to keep growing (Figure 3 A and 3B). 
Therefore, even though there is no known random poly- 
mer chemistry in which these probabilities are ever so 
high as necessary for supracritical growth -certainly not 
random polypeptides [27]-, we conclude that Lifsons [26] 
criticism remains a quantitative one, leaving open the pos- 
sibility that were it possible to obtain such catalysts, the 
catalytic network could still form spontaneously. 



A second objection to the model was that autocataly- 
tic sets could not have been formed spontaneously due 
to the paradox of specificity; that is, a high number of 
molecules is required for spontaneous emergence of a 
self-sustaining network of reactions [1], but the harmful 
effect of side reactions that ought to rise with an 
increasing set calls for a small system size [28]. One way 
to check whether harmful effects of side reactions in 
spontaneously emerging autocatalytic sets could inhibit 
network growth is to introduce strong non-competitive 
inhibition. It is easy to imagine that a species removes 
another species from the reactor by some side reaction, 
and we chose to implement the strictest possible sce- 
nario where one molecule of inhibitor removes the 
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inhibited species completely. In a manner analogous to 
the determination of catalytic reactions, each molecule 
inhibited any of the other molecules with the probability 
K. A species may therefore be both an inhibitor and 
also have other positive catalytic effects. It should be 
noticed that competitive inhibition already emerges in 
the model in the case where a catalyst uses another cat- 
alyst as a substrate, and so it is not necessary to expli- 
citly add this. At high levels of inhibition (e.g. K = 0.01) 
the consequence is to convert what would have been a 
supracritical network to a subcritical one. However, at 
lower levels (e.g. K = 0.001) the effects of poisoning do 
not radically prevent supracritical growth, but the net- 
works grow non-monotonically due to the loss of some 
catalysts because of inhibition (Figure 3C). We therefore 
conclude that inhibition does not qualitatively prevent 
formation of large catalytic reaction networks. To sum- 
marize, the formation of autocatalytic sets is robust 
against the two main criticisms that have been raised 
against the model. 

All the previous simulations assumed that only cata- 
lyzed reactions happen in the flow reactor. Bagley and 
co-authors [14,15] modelled the background of uncata- 
lyzed reactions as spontaneous fluctuations that 
resulted in the rare appearance of autocatalytic sub- 
graphs from the shadow of existing reactions (subset of 
species that can be produced from existing species in 
uncatalyzed reactions), an approach we find proble- 
matic because (i) it already assumes without proof that 
autocatalytic loops are present and (ii) it only allows 
for loops where each step is catalytic, discarding a 
large variety of possible organizations. In order to 
avoid this flaw, we simulated the uncatalyzed reactions 
directly. In our model rare uncatalyzed reactions pro- 
duce random novel species in low copy number from 
the shadow, and if the new molecule happens to be a 
catalyst, it will generate a chemical avalanche of 
directly and indirectly catalyzed further novel molecu- 
lar species. As expected, we found that only those spe- 
cies are able to permanently join the network that 
eventually catalyze their own production from already 
existing molecules and so produce a viable autocataly- 
tic loop (see Figure 2). Such viable loops define a new, 
distinct core within the autocatalytic set. Such a novel 
core is only rarely produced, at least in the small net- 
works we simulated, but the probability of their spon- 
taneous appearance depends on the size of the shadow 
and is expected to increase with network size and P. 
There is, therefore, an intrinsic slow tendency for non- 
food set mass to increase by rare incorporation of 
viable loops in the network that also results in the 
increase of complexity (Figure 4, Additional file 1). 
This appearance of novel cores is a critical property as 
we shall see next. 



Evolvability of chemical networks enclosed in 
compartments 

Our next step was to tackle the issue of evolvability 
when chemical reaction networks are confined into a 
small volume (compartment). Now the question is: what 
is required for chemical networks to undergo Darwinian 
evolution? As emphasized by Ganti [29] and Wachter- 
shauser [30,31], if distinct, organizationally different, 
alternative autocatalytic networks can coexist in the 
same environment then they could compete with each 
other and the 'fittest' would eventually prevail. This is 
obviously a narrow view of what a unit of evolution 
really is [17], but raises the important issue that reaction 
networks must somehow posses multiple attractors and 
transitions among attractors must be possible. As Wes- 
son [32] put it, "the attractor is the essence of self-orga- 
nization. Just what constitutes it and how the organism 
shifts from one attractor to another is a task for genet- 
ics... to elucidate". This message is even clearer in Con- 
way Morris [33], who posits that evolution navigates to 
particular functional solutions (convergence) thus point- 
ing to the existence of something analogous to 'attrac- 
tors' in biological systems. We demonstrate that for a 
catalytic network to accumulate adaptations it needs to 
be compartmentalized, the platonic reaction network 
must have multiple attractors, and some of these attrac- 
tors must be selectable. The larger the number of attrac- 
tors, the smaller the chance of convergence. 

Compartmentalizing the reaction networks enables fil- 
tering out harmful modifications and therefore it is a 
prerequisite for accumulating potential beneficial 'adap- 
tations', as demonstrated in [19]. We modelled compart- 
ments exactly as in Farmer et al. [12]; that is, each 
compartment is a flow reactor in which food is input, 
and materials leak out at first order. The number of 
attractors is itself of interest, for they allow a protocell 
to have multiple pathways of autocatalysis and also to 
show molecular variability to respond to an environ- 
ment. We approximated the number of attractors by fix- 
ing the reaction network and shuffling the chemical 
concentrations (by choosing random pairs of species 
and swapping their concentrations) several times in 
order to sample various initial conditions. After shuf- 
fling, the network dynamics are run for some fixed per- 
iod of time until an attractor is reached. Even if 
multiple attractors exist, stochastic division might not 
generate sufficient variation to allow transition between 
them. To test this, we also simulated the more realistic 
situation where the compartment enclosing the genera- 
tive chemistry was allowed to grow for a fixed period of 
time, after which it was assumed to split into daughter 
compartments, whose molecules were sampled from a 
polyhypergeometric distribution of molecular contents 
in the parental compartment. 
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Figure 4 Persistent increase in non-food set mass due to novel viable loops. We simulated 460 runs lasting 30,000 growth steps each, with 

food set size M = 4, P' = 0.75, P" = 0.0025, K = 0 (without inhibition), but with spontaneous emergence of rare novel species from uncatalyzed 

reactions. 5 out of 460 runs showed persistent increases in non-food set mass (8). This was always due to the incorporation of at least one 

viable loop. {A) Example of viable loop organization used in evolutionary simulations. Solid lines: reactions; dotted lines: catalytic activities. 

Orange dotted lines show the superimposed autocatalytic loop. The original network, on the left side of the blue line, is not shown in detail. 
I ) 



Now we arrive at the critical issue of evolvability, 
which can be first rephrased as the potential of a popu- 
lation of compartmentalized molecular networks with 
different attractors to respond to selection; that is, to 
transit between different attractors according to the fit- 
ness value assigned to each of them. As a preliminary 
test of evolvability, reaction networks were subjected to 
artificial selection. A small population of 10 compart- 
ments was isolated for a fixed generation period. After 
this time the fitness of each compartment, defined as 
the total mass of non-food species present at the end of 
the growth phase just prior to division, was assessed and 
production of the next generation occurred by taking 
molecule propagules from compartments on the basis of 
fitness proportionate selection (roulette-wheel selection 
with elitism [34]). This elitist selection was used to 
always sample at least one propagule from the individual 
with the highest rank in any given generation. 

The results of the artificial selection experiment were 
confirmed in numerical simulations of natural selection. 
Here we assumed an initial population of N = 100 com- 
partments and introduced a classical Moran process 
[35] to test the evolvability of reaction networks when 
subjected to natural selection. Thus, in each time step a 
randomly chosen compartment from the whole popula- 
tion is selected for growing at a rate which is a function 
of its chemical composition. The compartment is 
returned to the population whenever its size is less 
than 77 molecules and the step ends. If, however, size 
reaches 77, the compartment generates two daughter 



compartments by creation of two propagules. One off- 
spring replaces the parent compartment and the other a 
randomly chosen one from the population. In this sto- 
chastic process, the total number of compartments 
remains constant and given by N, but compartment's 
size can fluctuate between the propagule size and 77 
molecules. Selection for a specific target was implemen- 
ted by multiplying the rates of all reactions by a selec- 
tive advantage S if it matched the characteristic 
composition of the desired attractor. 

We tested the evolvability in all three previously 
described models - the original Farmer-type autocataly- 
tic sets, networks with inhibition, and networks with 
random novel species produced by uncatalyzed reactions 
- according to the principles described above. In the 
case of the original networks [12] the results were 
straightforward: they always have only one attractor 
(Additional file 1) and selection is not possible. This was 
not surprising considering that these networks contain 
only one autocatalytic core (Additional file 1). Therefore, 
the conclusion immediately follows from our previous 
considerations: Kauffman's [11] original polymer chem- 
istry when enclosed in a finite space will eventually crys- 
tallize into the same attracting network which can never 
ever be a Darwinian unit. 

Interestingly, this behaviour is analogous to concep- 
tually similar models' [4] where the whole catalytic net- 
work forms inevitably only one viable core and so 
ultimately converges to only one attractor. Therefore, 
one important conclusion to be derived from our work 
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is that we can definitively discard all autocatalytic net- 
works discussed so far in the literature as units of evolu- 
tion in the Darwinian sense, with the possible exception 
of [19]. 

However, it has been suggested [1] that the inclusion 
of inhibition in the Farmer-type network should permit 
the formation of autocatalytic sets having complex dyna- 
mical attractors. To determine if this is so we also run 
simulations introducing strong non-competitive inhibi- 
tion as indicated above. Interestingly, our results sub- 
stantiate this speculation because molecular networks 
now exhibited multiple attractors, but when the growth- 
splitting process was implemented spontaneous transi- 
tions between them were rare. When transitions did 
occur, they happened either periodically or chaotically 
(Additional file 1). Rather surprisingly, the artificial 
selection experiment excluded networks with inhibition 
from candidates of units of evolution, since the popula- 
tion typically settled down into one equilibrium or fluc- 
tuated stochastically or periodically between attractors 
and so attractors typically could not be stably selected 
(Additional file 1). Instead, the internal dynamics of the 
growth-splitting process completely overrode any effect 
of selection. This provides a clear counter-example to 
the widely accepted claim that the existence of multiple 
attractors is sufficient to allow selectability; it is not. 

The crucial modification to the model was to allow 
rare novel species to appear from the shadow. In the 
few cases of networks in which spontaneous addition of 
new species resulted in the ignition of a novel viable 
loop, and thus novel cores, there always existed multiple 
attractors (see Figure 2. for a didactic example and Fig- 
ure 4 for the network used in the evolutionary simula- 
tions). Note that we did not simulate inhibitory 
reactions in this version of the model; while they are 
certainly relevant in applications closer to real chemis- 
try, their inclusion would have made our results on 
viable loops more difficult to interpret. Analogous to the 
idea that 'attractors' in biological systems have different 
stabilities (i.e., convergence can be equated with the 
revisiting of most stable attractors [33]), we also 
detected stable attractors (with a larger attractor basin) 
where the system settles most of the time with occa- 
sional transitions to less stable attractors (with smaller 
attractor basins). 

We intuitively suspected that selection would work in 
networks with novel viable loops, and this was indeed 
the case. Our results can be summarized as follows: 
while networks with the viable core have an implicit 
selective advantage due to their higher growth rate, and 
so constitute the majority of the population, a one per- 
cent selective advantage attributed to the absence of the 
core is enough to significantly reduce the proportion of 
networks with viable cores in the population (Figure 5). 



The reason for the selectability in this model is that a 
novel viable core results in a new and distinct attractor 
for the reaction network, and due to its autocatalytic 
properties enables a higher (non-food mass) growth 
rate. Hence, we already have the basic requirements for 
natural selection to happen: two entities that are grow- 
ing exponentially at different rates and have different 
division times [36]. Since it is always possible to lose the 
viable core upon protocell fission (a loss mutation that 
is simply a function of propagule size), there is a kind of 
mutation-selection' balance if no novel chemical species 
can invade from the shadow. When rare reactions are 
allowed, novelty can arise by generation of new viable 
cores, and they can be removed by selection if they 
reduce the growth rate of the compartment. Between- 
compartment selection as shown in Figure 5 arises due 
to the effect a core has on the compartment level fit- 
ness. For example, the large core (Figure 4) sustains 
more non-food mass in its core and its periphery, and 
this increases the growth rate of the compartment. In 
reality, each molecule of the core and its periphery may 
confer a host of compartment level effects, e.g. modifica- 
tion of permeability of the membrane, specific metabolic 
adaptations, etc. that could have a compartment level 
fitness effect, but this is not explicitly modelled here. 

It is important to note that there are two levels of 
autocatalysis in this system. Even if the internal organi- 
zation of the network encapsulated by a protocell fails 
to be autocatalytic, the rule that after reaching the criti- 
cal mass the compartment divides into two effectively 
ensures that such compartments will have a 'generation 
time' and the potential to grow exponentially. Also auto- 
catalytic cores grow exponentially. Hence there is auto- 
catalysis at two levels: the level of molecules and that of 
compartments. The reproducing compartment without 
an enclosed autocatalytic network is not, however, a 
replicator, as it always assumes the same state and can- 
not sustain hereditary variation [28]. 

Conclusions 

It should come as no surprise, our finding that indepen- 
dent viable autocatalytic cores embedded in a large 
molecular network can be considered as units of evolu- 
tion, since the basic ingredients of differential growth 
rates and division times among potentially competing 
entities are fulfilled. The reader might, however, query 
that we have not properly addressed the issue of evolva- 
bility because no mention of heredity has been made. 
To fully understand that this criticism does not apply 
here it is important to appreciate the implications of the 
core-periphery dichotomy in autocatalytic sets. This 
dichotomy can be translated into a kind of genotype- 
phenotype mapping in fully fledged biological systems 
and, interestingly, allows us to appropriately use the 
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Figure 5 Selectability of potentially coexisting attractors in a molecular network. Each dot corresponds to a compartment just prior to 
division. (Top) Due to its autocatalytic properties a viable loop enables a higher growth rate and therefore the network with the large viable 
loop (characterized by 26 reactions and dividing after approximately 20 000 time steps) constitutes the most frequent network type. 
Spontaneous reaction rate = 0.00001. Propagule size 800; no selection (S = 1). {Bottom) However, with a mere 1 percent fitness advantage (5 = 
1.01) attributed to the networks without the loop, it is possible to reduce its frequency. In this case the original network without any viable 
loops is the most frequent. 



terms replication and reproduction despite the fact that 
we are dealing with an assembly of molecules [37]. 
Thus, the viable cores could be considered the units 
that replicate and, once transmitted to the offspring 
compartments after the parental compartment splits 
(reproduces), they give raise to the same periphery; that 
is; there is a clear matching between a viable core ('gen- 
otype') and the periphery (phenotype'). 'Mutation' hap- 
pens either when uncatalyzed reactions result in the 
emergence of a novel core, or when molecular compo- 
nents of a viable core are stochastically lost after com- 
partment splitting. Thus, our autocatalytic networks are 
capable of stably transmitting information across gen- 
erations [17]. 

However, a viable core constitutes one bit of heritable 
information and therefore the number of possible select- 
able attractors is relatively small, meaning that autocata- 
lytic networks may not be able to sustain open-ended 
evolution. While we think this to be the case, the poten- 
tial role of these autocatalytic networks as a route to 
nucleotide-based template self-replicating systems 
should not be underestimated. The chemical reaction 



networks show an intrinsic tendency to increase in com- 
plexity. Whenever novel spontaneous reactions occur, 
the number of possible uncatalyzed reactions also 
increases, opening up new possibilities for discovering 
viable cores (genotypes) and their corresponding periph- 
eries (phenotypes). This 'cooptive evolution' [13] 
involves stepwise expansions into (and retractions from) 
the adjacent possible of reaction space [38]. 

It is important to note that we do not claim that the 
present work renders the RNA world obsolete. In fact 
several of the authors of the present paper have worked 
under the assumption that indeed there had once been 
an RNA world. But this does not mean two things: first, 
that the RNA world was "clean" (probably it was not; 
other molecules, large and small, are likely to have been 
around and to have served even key functions), second, 
that reflexively autocatalytic networks could not have 
preceded the RNA world (they may have been indispen- 
sable "scaffolds", sensu Cairns-Smith, [39] for its appear- 
ance). As suggested by Dyson [10], RNA could have 
entered as a kind of waste/parasite, but already under 
the catalytic influence of evolved (as we argue) 
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autocatalytic networks. Genetic takeover does not in 
principle require reverse translation or any similar eso- 
teric process; one just needs room for stepwise innova- 
tion and improvement. 

Our results should be contrasted with the lipid world 
scenario [4,16,40] that so far has failed to offer models 
that would demonstrate a capacity for evolvability. The 
problem is that the simplicity of the underlying chemis- 
try in GARD (lipid molecules are either in the assembly 
or not) allows only as many reactions as there are differ- 
ent molecular species available in the environment. 
Moreover, the number of distinct lipid types cannot be 
too high, partly because of practical considerations, but 
also because increasing diversity implies increasing noise 
in compotype replication. The restricted diversity of 
molecules and reactions means that the system will 
always quickly converge to the state set in stone by the 
underlying dynamical equations [4]. The only possibility 
left open for change is the addition or removal of lipid 
species. Pointing out that altering the food set of a reac- 
tion network modifies its dynamics, however, has no 
relevance for evolution. A combinatorial chemistry like 
the polymer chemistry described in this article, on the 
other hand, provides an unlimited diversity of theoreti- 
cally possible reactions originating from the same food 
set and a reasonable probability that a reaction network 
can discover novel cores in its shadow. Also, the perma- 
nent incorporation of a new core will extend the sha- 
dow, opening up new possibilities. Therefore we argue 
that such a combinatorial chemistry (or one with similar 
complexity) is essential for even limited evolution. The 
complexity of a lipid world is overshadowed by the pos- 
sibilities enabled by the outlined polymer chemistry, 
which itself is only a shadow of the world of template- 
replicating nucleic acids. 

We stress that there is a crucial difference between 
small-molecule autocatalytic cycles (such as the reduc- 
tive citric acid cycle) and reflexively autocatalytic sets of 
polymers. First, a family of polymers (such as proteins) 
can be synthesized by a small set of canonical chemical 
reactions, whereas the reductive citric acid cycle consists 
of chemical steps of various kinds (cf. Orgel [41]), thus 
the former can more readily be catalyzed by environ- 
mental (i.e. unevolved) catalysts. Second, and more 
important, polymers can, due to their modular construc- 
tion, show targeted and specific activity in catalytic task 
space. The increased efficiency of catalysis carries over 
to resistance against side reactions [41] that constantly 
divert material from useful pathways. However, these 
facts merely change the probability of formation of 
viable cores in particular chemical systems, not the fact 
that Darwinian evolution is possible once they appear. 

The remaining open issues are experimental and theo- 
retical in nature. We need better models and, above all, 



relevant experiments. The systematic consideration of 
the experimental realization and evolvability of autocata- 
lytic networks of small organics (such as those of inter- 
mediate metabolism [2,19,29,31]) require further 
scrutiny in the light of the proposed selectability princi- 
ples. It is not farfetched to claim that an empirical 
scientific program implementing the sort of simple 
chemistry used in these models is worth pursuing. The 
recent calculations of Amend and McCollom [42] indi- 
cate that amino acid production in ancient hydrother- 
mal vents could have been thermodynamically favoured, 
providing a continuous supply of monomers for the 
hypothesized peptide network. Autocatalytic networks of 
peptides already have been synthesized by Ghadiri and 
Ashkenazy [21], although there a direct templating effect 
plays a crucial role. Protein networks that do not 
employ templating are more difficult to realize, but sev- 
eral recent advances hint they might be possible. There 
exists a dipeptide that does catalyze ligation of peptides 
[43] . Random peptides of length about 32 and 74, biased 
to known ratios of amino acids in evolved proteins, fold 
into compact structures for 30% of such sequences [44], 
and longer random polypeptides have shown to have 
catalytic activity [45]. The folding and catalytic proper- 
ties of "never before born" peptides is therefore an open 
experimental question that could be addressed with ran- 
dom peptide libraries - it is a project much to be 
sought, and we should make it clear that these experi- 
ments are now needed, and hopeful given the promising 
results cited above. 

Not every aspect of a key proposal for the sponta- 
neous emergence of dynamical chemical organizations 
can be scrutinized in a single paper. Here we restricted 
ourselves to three issues: (i) the probability of the 
nucleation of reflexively autocatalytic networks, as ques- 
tioned e.g. by Lifson [26], (ii) the side reaction problem, 
as raised by Orgel [41] and Szathmary [28], and (iii) the 
question of Darwinian evolvability of autocatalytic poly- 
mer sets, as left open by the previous investigations by 
Bagley et al. [14,15]. We think we have advanced pro- 
misingly with all three problems in the present work. 
Our work shows that autocatalytic sets as first devised 
by Dyson [9,10] and Kauffman [1,11] are theoretically 
possible despite previous criticisms and, perhaps more 
interesting, that chemical evolution in these systems can 
lead to the appearance of viable autocatalytic cores, thus 
opening the possibility for evolution by natural selection. 
We have used an abstract chemistry not to avoid real 
chemistry, but to seek general principles. For example, 
selection between autocatalytic cores may even be a pos- 
sibility in combinatorial inorganic systems evolving in 
iCHELL compartments [46]. Naturally, one cannot be 
satisfied with abstract toy chemistry for long. But there 
is always a first step, and the scenario outlined here 
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should give us hope that it is worthwhile to explore the 
idea further. After all, the pre-template Darwinian 
dynamics of rare core production and selection 
described here - fundamentally different from the 
mechanism advocated by Kauffman [1] and dismissed by 
Eigen [18] - is the only viable proposal so far for how 
autocatalytic reaction networks could accumulate 
adaptations. 

Methods 

Following Farmer et al. [12] a simplified model of cata- 
lyzed linear polymer reactions was devised consisting of 
polymers of alphabet size B = 2. Reversible, ligation 
(condensation) and cleavage reactions, catalyzed by 
another polymer, were modelled. Reactions were of the 
form: 

Ri + R 2 ^ Pi ( 2 ) 

where Ri and R 2 are the concentrations of reactants, 
Pi is the concentration of the product of a condensation 
reaction, and c is the catalyst. No attempt was made to 
relate the structure (i.e. sequence) of a polymer to its 
catalytic or reactive function. Each molecule has a cer- 
tain probability P of catalyzing each theoretically possi- 
ble ligation-cleavage reaction, and the reactions in 
which a polymer will participate, along with the cata- 
lysts, are determined randomly. 

The chemical kinetics approximates the behaviour of 
catalyzed reactions enclosed in a compartment. We 
assumed identical binding velocity for all intermediates. 
Only the catalytic rates vary in magnitude from 10 to 
1000. The food set contained polymers up to length M 
= 4. Food was present at initial concentration F c and 
added continuously at rate Finput* and all molecular spe- 
cies have a first order decay rate kj. We typically used 
10" 5 M as a minimum concentration threshold below 
which no molecule of the species exists in the reactor 
(for more details see Additional file 1). 

In order to address Lifson's criticism [26], the para- 
meter P was replaced by two independent probabilities: 
the probability P* that the molecule is a catalyst, and the 
probability P" that a molecule catalyzes a given reaction. 
In models including inhibition, when a new molecular 
species is first produced (in a manner analogous to the 
determination of catalytic reactions) it is determined 
which other species in the reactor the new species will 
inhibit with probability 7<T, and which existing species 
will poison the new species also with probability K. The 
probability of inhibition varied from 0 to 1. In the novel 
species model uncatalyzed reactions among available 
molecular species took place with a low probability (the 
concentration of any molecular species had a probability 
ranging from 0 to 0.01 to increase over threshold), 



producing random novel species in low copy number 
from the shadow. 

The three versions of the chemical model (original, 
inhibition and with novel species), all confined into 
compartments, were subjected to the same tests in 
order to assess their evolvability. We approximated the 
number of attractors in chemical networks by fixing the 
reaction network and shuffling the chemical concentra- 
tions (by choosing random pairs of species and swap- 
ping their concentrations) several times in order to 
sample various initial conditions. After shuffling, the 
network dynamics are run for 1000 time steps that 
appeared enough to reach a (new) attractor. Even if 
multiple attractors exist, stochastic division might not 
generate sufficient variation to transition between them. 
To test this, compartments enclosing the generative 
chemistry were allowed to grow for a fixed period of 
time after which compartment splitting was modelled by 
taking a polyhypergeometric distribution of molecular 
contents. Typically the propagule size was 500 mole- 
cules to allow sufficient variability, also permitting loss 
of species upon division. In addition, a low probability 
of spontaneous appearance in low copy number of spe- 
cies that already exist in the platonic reaction network 
(but have gone below threshold concentration) was 
assumed, allowing the re-emergence of lost species. 

In the artificial selection experiment a population of 
10 compartments was isolated for a fixed generation 
period. After this time the fitness of each compartment 
was assessed and production of the next generation of 
10 new compartments occurred by taking 800 molecule 
propagules from compartments on the basis of fitness 
proportionate selection; i.e. compartments were chosen 
to populate the next generation in proportion to their 
fitness by using roulette-wheel selection with elitism 
[34]. Elitism was used to always sample at least one pro- 
pagule from the individual with the highest rank in any 
given generation. Two different targets of selection were 
imposed. First, fitness of a compartment was defined as 
the total mass of non-food species (sum of the concen- 
tration multiplied by length for all non-food species) 
present in a compartment at the end of the growth 
phase, just prior to division. Second, the fitness was 
defined as the reciprocal of this value. As the last step, 
the response of the chemical networks to natural selec- 
tion was tested in numerical simulations. We introduced 
a classical Moran process [35] with the following para- 
meters: propagule size was 800 molecules, division 
occurred at the compartment size (77) of approximately 
half of the equilibrium size (approx. 2.2 mass units of 
non-food; mass is calculated as concentration multiplied 
by length) and the population size (N) was set to 100 
compartments. Selection for a specific target was imple- 
mented by multiplying the rates of all reactions by a 
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selective advantage S (set to 1.01 or 1.1) if it matched 
the characteristic composition of the desired attractor. 
The lack of core were defined as none of its members 
present at concentration > 0.02. 

Additional material 



Additional file 1: Spontaneous Formation and Evolution of 
Autocatalytic Sets within Compartments. This document provides 
information on methodological details, gives examples of autocatalytic 
loops, and discusses results that were only briefly mentioned in the main 
text. 
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Reviewer's comments 

Reviewer 1: Prof. William Martin 

Reviewer's comments: This is a thoughtful but narrowly focussed paper, that is 
possibly too far removed from real chemistry and real metabolism in general 
design to be suited to BD. Overall, though, I think that the authors have 
presented a contribution worth publishing. My main complaint would be that 
they overlook the possibility of prexisting inorganic catalysts that are not part of 
their reaction products and the possibility that things chemical fall into place 
more along the lines of thermodynamics than along the lines of mathematics. 
Authors' response: Indeed, we used chemistry more for inspiration than for 
the actual derivation of model parameters. The main reason for this narrow 
focus was that we found the mathematical foundations of selectability in 
reaction networks utterly confused. How could we study which real 
chemistry could produce evolvable units without understanding what 
exactly makes a reaction network selectable? Naturally, our work is just a first 
step towards a more rigorous study of evolvable networks; and next we 
must understand the kind of chemistry that could realize distinct 
autocatalytic cores. We are planning to implement a more realistic rendering 
of chemistry, one based on affinity dependent on pattern matching, in the 
near future. It is indeed very important to increase chemical realism in such 
models, though this needs large computational resources. 
Reviewer's comments: But the network concept is surely useful here. I cannot 
see how Oparin fits into this and I am not convinced that this paper really 
addresses coacervates in any way. 

Authors' response: The idea of coacervates is not relevant for the paper, only 
Oparin's once widely accepted, and now widely debated idea of a self- 
assembling, self-reproducing, compartmentalized proto-metabolism. 
Reviewer's comments: One misses references to Mike Steel's papers on the 
mathematics of this topic. 

Authors' response: This mistake has been corrected. 

Reviewer's comments: It is excessive to mention Kauffmann's name 20 times 
in the text, evoking the impression that what was written in the literature is 
a good deal less important than who wrote it. 
Authors' response: We agree, and we modified the text accordingly. 
Reviewer's comments: In the practical sense of actually talking about 
biochemical origins, the authors seem too concerned about polymerization in 
my view, they should be more concerned about the source of the 
monomers. In that respect, papers by Harold Morowitz should be consulted, 
for example (Morowitz, H. J., J. D. Kostelnik, J. Yang, and G. D. Cody. 2000. The 
origin of intermediary metabolism. Proc. Natl. Acad. Sci. USA 97:7704-7708), 



Authors' response: We are less enthusiastic about the Morowitz et al. paper. 
First, the primitive ancestry of the reductive citric acid cycle has been 
forcefully advocated by Gunter Wachtershauser before. Second, we still do 
not have evidence for a non-enzymatically running such cycle (pyruvate 
carboxylation seems to be an obstacle). Third, we do have evidence for a 
non-enzymatic autocatalytic metabolic cycle: the formose reaction. For the 
time being we prefer not to be too dogmatic about concrete chemical 
scenarios. Ultimately it is an experimental issue. 
Reviewer's comments: as should the chapter: 

Amend, J. P., and T. M. McCollom. 2009. Energetics of biomolecule synthesis 
on early Earth. In "Chemical Evolution II: From the Origins of Life to Modern 
Society" American Chemical Society, Chapter 4, pp 63-94. 
where they will be surprised to find much information that strongly 
supports portions of their case, if they can bear the thought of what 
geochemical environment is involved. 

Authors' response: We are grateful for the reviewer for pointing out the 
chapter to us. It is now referenced in the article. It is indeed reassuring to 
see that the Gibbs free energy for the synthesis of amino acids is very 
favourable at natural sites like the Lost City. We are fully aware that 
geochemical realism must ultimately be an integral part of the whole story. 
We have taken the path of classical theoretical biology: if certain 
assumptions are made, what are the consequences for evolutionary 
dynamics. Ever since the work of Manfred Eigen in 1971, students of the 
origin of life must bear the thought of what selective forces could have 
been involved. 

Reviewer 2: Dr. Eugene V. Koonin 

This article addresses a question of extreme interest and importance, in a 
way the central question in the origin of life field: is adaptive evolution 
possible without replicating template polymers, in autocatalytic networks of 
chemical reactions? A compelling positive answer to this question will 
effectively be a solution (at least in outline) to the origin of life problem, i.e. 
a plausible path from a mix of simple organic molecules to a replicating 
system of modern type. Incidentally, this solution will also imply that there 
was no RNA World at the beginning of life. The question is by no means 
new, it has been addressed by many, starting apparently with Oparin who 
realized the problem clearly, even if only at the verbal level, and including 
some of the authors of this article who concluded in previous work that 
autocatalytic networks in general lack the evolvability potential. The 
outcome of the model and simulations described in the present article is 
different: the authors conclude that evolvability is characteristic of 
autocatalytic chemical networks if they have a particular structure and exist 
in networks of compartments. The new principle that emerges in this work 
is that of the separation of chemical networks into core and periphery. The 
small stable core consists of multiple autocatalytic loops whereas the 
periphery includes non-autocatalytic reactions and is generated by the core. 
In a sense, the core is analogous to the genotype whereas the periphery is 
an analog of the phenotype. In the simulations presented in the paper, this 
system displays bona fide Darwinian selection as well as drift: new cores 
emerge by 'mutation' and elements of cores can be stochastically lost. 
I have no reasons to question the validity of the results within the model 
and the chosen parameters. The limitations of the work that the authors 
fully realize are that: i) the model is purely abstract and does not include 
any real chemistry, ii) even if one accepts the possibility of replication of 
autocatalytic networks, no path to template-based replication is provided let 
alone the key genotype-phenotype link via translation. I would add that to 
me, the central problem is the stability of such a putative non-templated 
system of replicators in real time: is it all sufficient to provide the 
springboard for evolution leading to modern-type life forms? There is 
currently no answer to this key question, and in my opinion, such an answer 
cannot be reached at the level of abstraction adopted in the present work. 
However, the authors of the paper notice absolutely correctly that all 
problems in such a broad and extremely challenging field as the origin of 
life cannot be possibly addressed in a single paper. In my opinion, this 
stimulating paper is an important step in a most important direction. 
Authors' response: An understanding of the kinds of chemical organization 
that could sustain heredity is logically anterior to the problem of the 
stability of such organizations. We are planning to investigate the selective 
origin of template replication within such a framework in the near future. 
We also think that the origin of the genetic code is a considerably later step 
(as advocated by one of the authors): an interim (albeit dirty) RNA world 
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must have prevailed, simply as a consequence of the opportunistic nature of 

evolution by natural selection. 

Reviewer 3: Dr. Anthony Poole 
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